ACTGTCTGGAACTGGACTGAGTCACCAAAAGGCGAATGGCTTCATCTTATAAAATGTCTGAACAAAGCACA 
ACTTCTGAGCACATTTTACAGAAGACATGTGATCACCTGATCCTGACTAACCGTTCTGGATTAGAGACAGA 
CTCAGTAGCAGAGGAAATGAAGCAGACTGTGGAGGGACAGGGGCATACAGTGCACTGGGCAGCTCTCCTGA 
TACTCGCGGTGATAATACCCACCATTGGTGGGAACATCCTTGTGATTCTGGCTGTTGCACTGGAGAAAAGG 
CTGCAGTACGCTACCAACTACTTTTTAATGTCCTTGGCGATAGCAGATTTGCTGGTTGGATTGTTTGTGAT 
GCCGATTGCCCTCTTGACAATCATGTTTGAGGCTATATGGCCCCTCCCACTGGCCCTGTGTCCTGCCTGGT 
TATTCCTCGATGTTCTCTTTTCAACTGCCTCCATCATGCATCTCTGTGCCATTTCCCTGGACCGCTATATA 
GCCATCAAAAAGCCAATTCAGGCCAATCAGTGCAACACCCGGGCTACTGCATTCATCAAGATTACAGTGGT 
ATGGTTAATTTCAATAGGCATCGCCATCCCAGTCCCTATTAAAGGAATCGAGACTGATGTGATTAATCCAC 
AC AATGTC AC CTGTGAGC TGAC AAAGGAC C GC TTTGGC AGTTTTATGGTC TTTGGGTC AC TGGC TGC TTTC 
TTCGTACCTCTCACCATCATGGTAGTCACTTACTTTCTCACCATTCACACTTTACAGAAGAAAGCTTACTT 
GGTCAAAAATAAGCCACCTCAACGCCTAACACGGTGGACTGTGCCCACAGTTTTCCTAAGGGAAGACTCAT 
CCTTTTCATCACCAGAAAAGGTGGCAATGCTGGATGGGTCTCACAGGGATAAAATTCTACCTAACTCAAGT 
GATGAGACACTTATGCGAAGAATGTCCTCAGTTGGAAAAAGATCAGCCCAAACCATTTCTAATGAGCAGAG 
AGCCTCGAAGGCCCTTGGAGTCGTGTTTTTCCTTTTTCTGCTTATGTGGTGCCCCTTTTTTATTACAAATC 
TAACTTTAGCTCTGTGTGATTCCTGCAATCAGACCACTCTCAAAACACTCCTGGAGATATTTGTGTGGATA 
GGCTACGTTTCCTCGGGGGTGAATCCTCTGATCTATACACTCTTCAATAAGACATTTCGGGAAGCATTTGG 
CAGGTACATCACCTGCAATTACCGAGCCACAAAGTCAGTAAAAGCACTTAGGAAGTTTTCCAGTACACTTT 
GTTTTGGGAATTCAATGGTAGAAAACTCTAAATTTTTCACAAAACATGGAATTCGAAATGGGATCAACCCT 
GCCATGTACCAGAGCCCAATGAGGCTCCGATGTTCAACCATTCAGTCCTCATCAATCATCCTCCTCGATAC 
CCTTCTCACTGAAAACGATGGCGACAAAGCGGAAGAGCAGGTCAGCTACATATTGCAGGAACGGGCCGGCC 
TCATCTTGAGAGAGGGTGATGAGCAGGACGCACGCGCACCATGGCAGGTTCAAGAGTGA (SEQ ID NO: 1) 



MASSYKMSEQSTTSEHILQKTCDHLILTNRSGLETDSVAEEMKQTVEGQGHTVHWAALLILAVIIPTIGGN 
I LVI L AVALEKRLQYATNYFLMSL AI ADLLVGLF VMP I ALLT IMFEAI WPL PL ALC PAWLFLDVLF STAS I 
MHLCAISLDRYIAIKKPIQANQCNTRATAFIKITWWLISIGIAIPVPIKGIETDVINPHNVTCELTKDRF 
GSFIWFGSLAAFFVPLTIMWTYFLTIHTLQKKAYLVKNKPPQRLTRWTVPTVFLREDSSFSSPEKVAMLD 
GSHRDKILPNSSDETLMRRMSSVGKRSAQTISNEQRASKALGWFFLFLLMWCPFFITNLTLALCDSCNQT 
TLKTLLEIFWIGWSSGWPLIYTLFNKTFREAFGRYITCNYRATKSVKALRKFSSTLCFGNSMVENSKF 
FTKHGIRNGINPAMYQSPMRLRCSTIQSSSIILLDTLLTENDGDKAEEQVSYILQERAGLILREGDEQDAR 
APWQVQE (SEQIDNO:2) 



FIGURE 1 



Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



ACTGTCTGGAACTGGACTGAGTCACCAAAAGGCGAATGGCTTCATCTTATAAAATGTCTG 
AACAAAGCACAACTTCTGAGCACATTTTACAGAAGACATGTGATCACCTGATCCTGACTA 

ACCGTTCTG GATTAGAGACAGACTCAGTAGCAGAGGAAATGAAGCAGACTGTGGAGGGAC 
AGGGGCATACAGTGCACTGGGCAGCTCTCCTGATACTCGCGGTGATAATACCCACCATTG 
GTGGGAACATCCTTGTGATTCTGGCTGTTGCACTGGAGAAAAGGCTGCAGTACGCTACCA 
ACTACTTTTTAATGTCCTT GGCGATAGCAGATTTGCTGGTTGGATTGTTTGTGATGCCGA 

TTGCCCTCTTGACAATCATGTTTGAGGCTATATGGCCCCTCCCACTGGCCCTGTGTCCTG 

CCTGGTTATTCCTCGATGTTCTCTTTTCAACTGCCTCCATCATGCATCTCTGTGCCATTT 
CCCTGGACCGCTATATAGCCATCAAAAAGCCAATTCAGGCCAATCAGTGCAACACCCGGG 
CTACTGCATTCATCAAGATTACAGTGGTATGGTTAATTTCAATAGGCATCGCCATCCCAG 
TCCC TATT AAAGGAATCGAGAC TGATGTGATTAATCC AC ACAATGTC ACC TGTGAGCTGA 
CAAAGGACCGCTTTGGCAGTTTTATGGTCTTTGGGTCACTGGCTGCTTTCTTCGTACCTC 
TCACCATCATGGTAGTCACTTACTTTCTCACCATTCACACTTTACAGAAGAAAGCTTACT 
TGGTCAAAAATAAGCCACCTCAACGCCTAACACGGTGGACTGTGCCCACAGTTTTCCTAA 
GGGAAGACTCATCCTTTTCATCACCAGAAAAGGTGGCAATGCTGGATGGGTCTCACAGGG 
ATAAAATTCTAC CTAACTCAAGTGATGAGACACTTATGC GAAGAATGTCC TC AGTTGGAA 
AAAGATCAGC CC AAACCATTTC TAATGAGC AGAGAGC CTCGAAGGCCC TTGGAGTCGTGT 
TTTTCCTTTTTCTGCTTATGTGGTGCCCCTTTTTTATTACAAATCTAACTTTAGCTCTGT 
GTGATTCCTGCAATCAGACCACTCTCAAAACACTCCTGGAGATATTTGTGTGGATAGGCT 
ACGTTTCCTCGGGGGTGAATCCTCTGATCTATACACTCTTCAATAAGACATTTCGGGAAG 
CATTTGGCAGGTACATCACCTGCAATTACCGAGCCACAAAGTCAGTAAAAGCACTTAGGA 
AGTTTTCCAGTACACTTTGTTTTGGGAATTCAATGGTAGAAAACTCTAAATTTTTCACAA 
AACATGGAATTCGAAATGGGATCAACCCTGCCATGTACCAGAGCCCAATGAGGCTCCGAT 
GTTCAACCATTCAGTCCTCATCAATCATCCTCCTCGATACCCTTCTCACTGAAAACGATG 
GC GACAAAGC GGAAGAGC AGGTCAGCTACATATTGC AGGAAC GGGCCGGC CTCATCTTGA 
GAGAGGGTGATGAGCAGGACGCACGCGCACCATGGCAGGTTCAAGAGTGA 



FIGURE 2A 
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Gene Sequence 
Structure * 



130 bp 



Sequence Deleted 



319 bp 



Size of full-length 
cDNA: 1550 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 2520 



5' arm 



LacZ-Neo 

Cassette 



3 s arm 



Arm Length: 
5 1 : 1.6 kb 
3": 5 kb 



Targeting Vector 

— - Endogenous Locus 



* Not drawn to scale 




5 1 >TGAGTGTCTGGTGGGTTTGCT I 
AAATGCTTTGC TAAAGC AGATGAC j 
TTGCTTAGCTACTGACCATGCTGA j 
CCACTGTCTGGAACTGGACTGAGT j 
CACCAAAAGGCGAATGGCTTCATC j 
TTATAAAATGTCTGAACAAAGCAC 1 
AACTTCTGAGCACATTTTACAGAA I 
GACATGTGATCACCTGATCCTGAC | 
TAACCGTTCTG<3 1 
(SEQIDNO:3) 



5 1 >GGCGATAGCAGATTTGCTGGT r 
TGGATTGTTTGTGATGCCGATTGC ; 
CCTCTTGACAATCATGTTTGGTGA \ 
GTATTTCCCCTTGTTCCTGCCACT ! 
GAAC AC TAC TAAC GTAGTGAAATG t 
GACACTCACTGACCTTTATTTTGT : 
TTGAAAT AAAAGAAGGAC C TGGAT 1 
TAAAAACACAGAAGGGAACATTCC 
TTC ATTTTTCA< 3 1 
(SEQ ID NO:4) 



FIGURE 2B 



